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Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 



1 . D1 : MCKINNEY: 'Digital Audio Compression Standard AC-3' 20 December 

1995, ADVANCED TELEVISION SYSTEMS COMMITTEE XP002075746 
cited in the application 

2. The application belongs to the field of digital signal processing: samples of an 
audio signal are sent in frames, subdivided in blocks or sets. The samples are 
represented in mantissa-exponent pairs, and the goal is to code the exponents in 
an effective way. 

3. Claims 1,16 and 29 involve an inventive step: the method proposed in D1 does 
not foresee the use of different differential coding limits for different exponent sets, 
which results in a more effective coding, nor would this improvement have been 
obvious for the skilled person. 

4. The dependent claims concern advantageous embodiments of the subject-matter 
of the independent claims and thus their subject-matter is also considered to be 
novel and inventive. 
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Re Item VII 

Certain defects in the international application 



1 The features of the claims are not provided with reference signs placed in 
parentheses (Rule 6.2(b) PCT). 

2 The independent claims are not in the two-part form in accordance with Rule 
6.3(b) PCT, which in the present case would be appropriate, with those features 
known in combination from the prior art (document D1) being placed in a 
preamble (Rule 6.3(b)(i) PCT) and with the remaining features being included in a 
characterising portion (Rule 6.3(b)(ii) PCT). 
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Re Item VIII 

certain observations on the international application 



1 All the features of claim 16 are present in claim 1. 

The aforementioned claims therefore lack conciseness. Moreover, lack of clarity of 
the claims as a whole arises, since the plurality of independent claims makes it 
difficult, if not impossible, to determine the matter for which protection is sought, 
and places an undue burden on others seeking to establish the extent of the 
protection. 

Hence, claims 1 and 16 do not meet the requirements of Article 6 PCT. 

2 Claims 13, 26 and 35 are not clear, since it is not clear what the quantity 'i' 
represents. 

3 It seems that in claims 12, 25 and 34 and in the corresponding passages of the 
description the wording 

"f(Yi) is -hi if Y>0..." 

should read 

"f(Yi) is +1 if yi>0.,.". 

4 The text at the beginning of the last line of pages 3 and 5 is partially unreadable. 

5 The text at the beginning of page 3 does not match that at the end of page 2. 

6 The text at the beginning of page 4 does not match that at the end of page 3. 
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Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 



1 



Statement 



Novelty (N) 



Yes 
No: 



Claims 
Claims 



1-37 



Inventive step (IS) 



Yes 
No: 



Claims 
Claims 



1-37 



Industrial applicability (lA) Yes: Claims 1-37 

No: Claims 

2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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computucionaJ resources. 
Summary of the Invention 



5 la accordance with the preseni invenrlon, there t.s provided a method for processing daca in 
aji audio data encoder, the data comprising a sequence of exponent sets each comprising a 
plurality of exponeacs, comprising the seeps of: 

determiniiig a first variation of exponeiic values within a first exponent set; 
deteiniining a second variation of exponent values between said first exponent set and 
1 0 each subsequent exponent set in said sequence; and 

assignmg an exponent coding strategy to the iiist exponent set based on the determined 
fust and second variations, wherein the exponent coding strategy is assigned from a plurality 
of exponent coding strategies having different differential coding limits. 

15 In one form of the invention, the method includes selecting one of said subsequent exponent 
sets on the basis of said first variation and assigning an exponent rc-usc coding strategy to the 
selected exponent set and any exponent sets in said sequence between the first exponent set 
and the selected exponent set. 

20 Preferably the method includes a step of deteiminiiig a second variation between consecudve 
exponents in said first exponent set, wherein the exponent coding strategy for said first 
exponent set is selected on the basis of said first and second variations. 



The step of selecting the exponent coding strategy for the first exponent sec is preferably 
25 performed using neural network processuig. In that case, die neural network processing may 
include a feature exTraction stage in which said sequence of exponent sets is processed to 
determine said first variadon values, a weighted routing stage in which said first variation 
values are weighted according lo predetermined weighdng values and routed to inputs* of a 
first xieural layer, a selection stage in which an output of the first neur^U layer is selected, and 
30 an output processing stage in which a coding strategy is assigned to said first exponent set 

I of said selection stage and said second variation. 
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where /(Vi) is -hi if y^O else it is 0, 

Y represents outputs of the first neural layer, 
T arc threshold values decerniined during a training phase, 
w are weighting values determined during the training phase, and 
5 b is the number of exponent sets ui the sequence comprising said data. 

In one form of the invention, the selection stage con^prises selecting an output of the first 
neural layer such that = 1 and a is maximum for i<a<b. The plurality of exponent 
coding strategies may comprise strategies S,. Sj, , S^, where c i b, corresponding to 
10 respective differential coding limits 1, 2, ... ,c, wherein the exponent coding strategy 
assigned to said first exponent sec E, is selected according to 

max[min(/2+l,a(/:,)) J] 
where a(Ej ^ f^ocniZfi^i^^r^i.fi^n) +0.5). 

15 The exponent sets in the sequence between z+ 1 and o, inclusive, can then be assigned the 
exponent re-use coding strategy. 

llie present invention also provides a method for coding audio data comprising a sequence 
of exponent sets each comprising a plurality of exponents, comprising the steps of: 
20 determining a first variation of exponent values between a first exponent set in the 

sequence and each subsequent exponent set in said sequence; 

selecting an exponent coding strategy for said fjrst cxponcnr set from a plurality of 
rategics on the basis of said first variation; and 
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cocling said first exponent set according to the selected exponent coding strategy, 
wherein each of the plurality of exponent coding strategies corresponds ro a different 
differential coding limit. 



5 The invention -ilso provides a digital audio encoder in which audio data is cransfonned into 
coefficients having mantissas and exponents arranged in a sequence of sets, having; 

a first variation processor coupled to receive the exponents of sets from said sequence 
and determine a lirst variation of exponent values between a fust set and a plurality of 
subsequent sets in the sequence; 
10 a second variation processor coupled to receive the exponents of said first set and 

determine a second variation between consecutive exponent values within said first set; and 
a neural network processor coupled to receive said fu-st and second variations and 
select and assign an exponent coding surategy to said first set from a plurality of codkig 
strategies on the basis of said first and second vaiiauons, wherein each of the plurality of 
1 5 coding strategies correspond to a different differential coding limit. 



Brief Description of the Drawingt^ 



The invention is described in greater detail hereinafter, by way of example only, with 
20 reference to several embodiments, and having regard to the aceorapauying drawings, wherein: 

>'igure 1 ii> a diagrammatic illustration of the data structure of an encoded audio data 
bit scream, having frames and blocks; 

Figure 2 is a functional block diagram of an audio encoder; 

Figure 3 is a diagrammatic illustration of mapping of exponent sets to coding 
25 strategics; and 

Figure 4 is a diagraintnatic illustration of a neural network system for selection of 
exponent coding strategies. 



Detailed Description of the Preferred Embodiments 

30 
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Claims: 

1. A method for processing data in an audio data encoder, the data comprising a sequence 
of exponent sets each comprising a pluraiity of exponents, comprising the steps of: 

5 determining a first variation of exponent values wittin a first exponerxt set; 

dcccriniaing a second variation of c^puncnt values betwct;a said first exponeni sec and 
each subsequent exponent set in said sequence; and 

assigning an exponent coding strategy to the first exponent set based on ihe determined 
tlrst and second variations, wherein the exponent coding strategy is assigned from a plurality 
10 of exponent coding strategies having different differential coding limits. 

2. A method as claimed in claim 1, including a step of coding said first exponent set 
according to the assigned exponent coding strategy. 

15 3, A method as claimed in claim 2, including a step of assigning an exponent coding 
su-ategy to at least one subsequent exponent set based on the correr>ponding determined second 
variation, 

4. A method as claimed in claim 3, wherein the plurality of exponent coding strategies 
20 includes an exponent set re-use strategy which is assigned to the at least one subsequent 

exponent set, 

5. A tncdiod as claimed in claim 4, including a step of coding said fust exponent set and 
said at least one subsequent exponent set according to the corresponding assigned coding 

25 strategies - 

6. A method as claimed in claim 1 or 5. wherein the steps of cLcrermining the first and 
second variations are performed uulising neural ncwork processing. 

30 7. A metliod as claiined in claim 6. wherein the neural network processing includes first 
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and Second neural layeri;. 

• S. A method as claimed in claim 6, wherein the neural network processing comprises a 
feamre exiraction stage in which uaid sequence of exponent sets is utilised no determine said 
5 second variations, a weighted routing stage in which said second variations are weighted 
according to predetermined weighimg values and routed to iuputs of d first neural layer, a 
selection stage in which an output of die fu-sc neural layer is selected, and an output 
processing stage in which a coding strategy is assigned to said first exponent set based on said 
first variation and the output of said selection stage. 

10 

9. A mcdiod as claimed in claim 8, wherein a coding strategy is assigned to at IcTHiZ one 
subsequent exponent set on the basis of the output of said selection stage. 

LO, A method as claimed in claim 9, wherein die coding strategy assigned to the at least 
15 one subsequent exponent set is an exponent re-use strategy. 

11. A method as claimed in claim 8, wherein the feature cxuaction stage comprises 
determining 

Adiff(E,.E^) = (2„le,„-e,„|)/n 
20 where Adiff is said second variation, 

is said first exponent set and E- is a subsequent exponent set with j > i. 

n is an integer representing the number of exponents in a said set of exponents, 
-5 m 0, 1, 2 n-1. 

12. A method as claimed in claim 11. wherein the processing carried out by the weighted 
routing stage and first neural layer include<; determining 
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the operator r[«] is denned as: 
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where /(7;) is -f I if y^O else it is 0, 

yreprci>eiite outputs of the first neural layer, 
T are threshold values detennined during a training phase, 
5 H» are weighting values detennined daring the training phase, and 

b is the number of exponent sets in the sequence comprising said data. 

13, A method as claimed in claim 12» wherein the selection stage comprise,^ selecting an 
oucpui of the first neural layer such that J'^ = I and a is maximum fox i < a < &, 

10 

14. A tnediod as claimed in claim 13, wherein the plurality of exponent coding strategies 
comprises strategics S., S,, ... , S^, where cub, corresponding to differential coding limits 
1,2 c. 



AMENDED SHEET 



- 23 - 

15. A mechod as claimed in claim 14, wherein the exponent coding strategy S, assigned 
to said first exponent set is selected according to 

Y = max[imn(a f l,o(£))J] 
where o{E,> - rloor((i:jje,j^,.eij/n)-f 0.5), 

5 

16. A meihod for coding audio data comprising a sequence of exponent sees eacbt 
comprising a plurality of exponents, comprising the steps of: 

deiermining a first variation of exponent values between a first exponent set in ttie 
sequence, £ind each subsequent exponent set in said sequence; 
10 selecting an exponent coding strategy for said first exponent set from a plurality of 

exponent coding strategies on the basis of said first variation; and 

coding said first exponent set according to the selected exponent coding strategy, 
wherein each of the plurality of exponent . coding strategies corresponds to a different 
differential coding limit. 

15 

17. A method as claimed in claim 16, including selecting one of said subsequent exponent 
sets on the basis of said first variarion and assigning an exponent re-use coding strategy to the 
selected exponent set and any exponent sets in said sequence between the first exponent set 
and the selected exponent set. 

20 

18. A method as claimed in claim 16 or 17. including a step of determining a .second 
variation between consecutive exponents in said first exponent set, wherein tlie exponent 
coding strategy for said rlrst exponent set is selected on the basis of said first and second 
variations. 

25 

19. A method as claimcxi in claim 16, wherein the step of selecting the exponent coding 
strategy for said ftrsr exponent %qx is performed utilising neural nerwork processing, 

20. A method as claimed in claim 18, wherein the seep of selecting the exponent coding 
30 strategy for said first exponent set is performed utilising neural network processing. 
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21. A method as claimed in ciaim 20, wherein the neural network processing comprkes 
a feature extraction stagc in which said sequence of expoaem sets is processed to determine 
Said first variadou values, a weighted routing stage in which said first variation values arc 
weighted according co predetermined weighcing values and routed co inpuL<5 of a tlrst neural 

5 layer, a selection stage in which an output of the first neural layer is selected, and an output 
processip.g stage in which a coding strategy is assigned to said ilrst exponent set based on the 
output of said selection stage and said second variadon. 

22. A method as claimed in claim 21 , wherein a coding strategy is assigned to at least one 
10 subsequent exponent set on the basis of the output of said selection stage. 

23- A method as claimed m claim 22. wherein the coding strategy assigned to the at ieast 
one subsequent exponent set is an exponent re-use strategy. 

15 24. A method as claimed in claim 21, wherein the feanire extraction stage comprises 
determining the firijt variation values according to 
Adifi(E,.Ep = (2, ie,,-c^,^i)/n 
where Adiff is said first variation, 

Ej is said first exponent set and Ej is a subsequent exponent set witii i >j, 
20 E, = (e, u, Q, ,, , e,,,,i). 

^ ~ f*^i.a» ^j.i. ^j,2» ^jji-i)' 

n is an ijiteger representing the number of exponents in a said set of exponents, 
m - 0. 1. 2 n-1. 

25 25. A method as claimed in claim 24, wherein tlie processing carried out by the weighted 
routing stage and first neural layer includes deterrcining 
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the operator r[*] is defined as: 
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where /(Yj) is -h 1 if y>0 else it is 0. 

Y represents outputs of the first neural layer, 
Tare threshold values determiiicd during a training phase, 
5 H' aie weighting values determined during the tralaing phase, and 

h is the number of exponent sets in the sequence comprising said data. 

26. A method lis claimed in claim 25, wherein the selection stage comprises selecting an 
output y„ of the first neural layer such chat = 1 and a is maximum for \ <a<h. 

10 

27. A method as claimed in claim 26, wherein the plurality of exponent coding strategies 

comprises scrategies S,. where c. ^ corresponding to respective differential 

cuUing limits I. 2 c. 
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28. A meihod as clain^cd in claim 27, wherein the exponent coding strategy assigned 
' CO said firsc exponent set E is selected according to 

r = max[min(G+i,o(£'i)),l] 
where a(Ei) = floor((2^|iei,j,,-e,J|/n)^0.5). 

5 

29. A digital audio encoder in wliich audio data is transformed into coefficients havina 
mantissas and exponents arrange in a sequence of sets, having: 

a first vaiuition processor coupled to receive the exponents of sets fi-om said sequence 
and determine a first variation of exponent values between a first set and a plurality of 
10 subsequent sets in the sequence; 

a second variation processor coupled to receive the exponents of said first set and 
determine a second variadon between consccurive exponent values within said first set; and 
a neural network processor coupled to receive said firsi and second variations and 
select and assign an exponent coding strategy to said first set from a plurality of coding 
15 strategies on the basis of said fust and second variations, wherein each of che plurality of 
coding strategies corxetipond to a different differencial coding limiL. 

30. An audio encoder as claimed in claim 29, wherein the neural network processor also 
selects and assigns an exponent coding strategy to at least one of the subsequent seis. 

20 

31. An audio encoder as claimed in claim 30, wherein the exponent coding strategy 
assigned to the at least one subsequent sets is an exponent re-use strategy. 

32. An audio encoder as claimed in claim 29, wherein the neural network processor 
25 includes a weighted routing >;rage in which said first variation values are weighted according 

to predetermined weighting values and routed to inputs of a first neural layer, a selection stage 
in which an output of the first neural layer is selected, and an output processing stage in which 
a coding strategy is assigned to said first exponent set based on the output of said selection 
stage and .^aid second variation. 
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33. An audio encoder as claimed in claim 32, wherein the first variatioa processor is 
.arranged to determine said first variation according to 

Adift(E,E; =(2^|c,^-e,^|)/n 
where Adiff is said first variaiion, 

Ej is said fuse exponeat set and Ej is a subsequent crxponcnc set with i >j» 

~ ("i.0« ^i.M ' 

n is an integer representing the number of exponents in a said set of exponents, 
m 0, 1, 2 , n-l. 

34. An audio coder as claimed in claim 29 or 33. wherein the weighted routing stage of 
the neural network processor is arranged to decermine 
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tlie operator r[«] is defined as: 
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where 



/(Yi) is -^-1 if YiO else it is 0, 
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Krepreaeiits outputs of the first neural layer. 

Tare threshold values detenmned during a training phase. 

w arc weighting values determined during Che training phase, and 

b is the number of sets in the sequence, 

5 

35. An- audio encoder as claimed Lii claini 34, wherein the selection stage comprises 
selecting an output of the first neural layer such that >^ ^ L and a is maximum for i<a<b, 

36. A method as claimed in claini 35, wherein die plurality of exponent coding strategies 
10 comprises strategies S^, S., , where c ^ corresponding to respective differential 

coding limits 1. 2, ... ,c:. 

37. A method a<; claimed in claim 36, wherein the exponent coding strategy assigned 
for encoding exponents in said first set E-is selected according to 

i 5 V = max[min(a 4- 1 ,o{Ei)) , 1] 

where a(Ej) = floor((^j|eij,^^.j/n) 4-0.5). 
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(57) Abstract 

A method and apparatus for assigning an exponent coding strategy in a digital audio transform coder. Different coding strategies 
having different differential coding limits may be assigned to different set of transform exponents according to the frequency domain 
characteristics of the audio signal. A neural network processing system is utilised to perform an efficient mapping of each exponent set to 
an appropriate coding strategy. 
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A NEURAL NETWORK BASED METHOD FOR EXPONENT CODING 
IN A TRANSFORM CODER FOR HIGH QUALITY AUDIO 

Technical Field 

5 

This invention is applicable in the field of audio coders wliich employ exponent coding 
techniques to provide high levels of digital Har:^ compression. 

Background Art 

10 

In order to more efficiently broadcast or record audio signals, the amount of information 
required to represent the audio signals may be reduced. In the case of digital audio 
signals, the amount of digital inforaiation needed to accurately reproduce the original 
pulse code modulation (PCM) samples may be reduced by applying a digital con^)ression 
15 algorithm, resulting in a digitally con^ressed representation of the original signal. The 
goal of the digital compression algorithm is to produce a digital representation of an audio 
signal which, when decoded and reproduced, soimds the same as the original signal, while 
using a minimum of digital infomiation for the compressed or encoded representation. 

20 Recent advances in audio coding technology have led to high compression ratios while 
keeping audible degradation in the compressed signal to a minimum. These coders are 
intended for a variety of applications, including 5. 1 channel film soundtracks, HDTV, 
laser discs and multimedia. Description of one applicable method can be found in the 
Advanced Television Systems Committee (ATSQ Standard document entitied "Digital 

25 Audio Compression (AC-3) Standard", Document A/52, 20 December, 1995, 

In the basic approach, at the encoder the time domain audio signal is first converted to the 
frequency domain using a bank of filters. The frequency domain coefficients, thus 
generated, are converted to fixed point representation. In fixed point syntax, each 
30 coefficient is represented as a mantissa and an exponent. The bulk of the compressed 
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bitstream transmitted to the decoder comprises these exponents and mantissas. 

The exponents are usually transmitted in their original form. However, each mantissa 
must be truncated to a fixed or variable number of decimal places. The number of bits to 
5 be used for coding each mantissa is obtained from a bit allocation algorithm which may be 
based on the masking property of the human auditory system. Lower numbers of bits 
result in higher compression ratios because less space is required to transmit the 
coefficients. However, this may cause high quantization errors, leading to audible 
distortion. A good distribution of available bits to each mantissa forms the core of the 
10 advanced audio coders. 

Further compression is possible by employing differential coding for the exponents. In 
this case, the exponents for a channel are differentially coded across the frequency range. 
That is, instead of sending actual values, for many exponents only the difference between 
15 adjacent exponent values is sent. Furthermore, instead of coding every exponent a single 
exponent value is sent for every two exponents. In the extreme case, when exponent sets 
of several consecutive blocks in a frame are almost identical the exponent set for the first 
block is only sent, and the remaining blocks in the frame reuse the previously sent values. 

20 These different methods amount to different exponent coding schemes presently in use, 
and are commonly referred to as exponent strategies. 

The differential exponent coding schemes presendy in use limit the difference between 
consecutive exponents to a fixed value which is predefined by the standard. This does not 
25 take into consideration the signal characteristic. As a result, sometimes for a fast varying 
audio spectrum the limit is too low, which results in information loss due to truncation. 
At other times the limiting value may be too large for a slow varying spectrum, thereby 
resulting in bit wastage. 

30 A good set of exponent strategies is therefore desirable, together with an efficient 
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processing engine which determines which exponent strategy is most suited for a panicular 
set of exponents. The processing engine should not be loo computationally intensive to 
allow encoders to operate in real time and run on systems having relatively small 
computational resources. 

5 

Summary of the Invention 

In accordance with the present invention, there is provided a method for coding audio data 
comprising a sequence of exponent sets each comprising a plurality of exponents, 
10 comprising the steps of: 

determining a first variation of exponent values between a first exponent set in the 
sequence and each subsequent exponent set in said sequence; 

selecting an exponent coding strategy for said first exponent set from a plurality of 
exponent coding strategies on the basis of said first variation; and 
15 coding said first exponent set according to the selected exponent coding strategy. 

Preferably each of the plurality of exponent coding strategies corresponds to a different 
differential coding limit. 

20 In one form of the invention, the method includes selecting one of said subsequent 

exponent sets on the basis of said first variation and assigning an exponent re-use coding 
strategy to the selected exponent set and any exponent sets in said sequence between the 
first exponent set and the selected exponent set. 

25 Preferably the method includes a step of determining a second variation between 

consecutive exponents in said first exponent set, wherein the exponent coding strategy for 
said first exponent set is selected on the basis of said first and second variations. 

The step of selecting the exponent coding strategy for the first exponent set is preferably 
30 performed using neural network processing. In that case, the neural network processing 
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may include a feature extraction stage in which said sequence of exponent sets is processed 
to determine said first variation values, a weighted routing stage in which said first 
variation values are weighted according to predetermined weighting values and routed to 
inputs of a first neural layer, a selection stage in which an output of the first neural layer 
5 is selected, and an output processing stage in which a coding strategy is assigned to said 
first exponent set based on the output of said selection stage and said second variation. 

Preferably the feature extraction stage comprises determining the first variation values 
according to 

10 AdijSP^,, E) = (S^ 1 e,^^ - £. „| )//! 

where Adiff is said first variation, 

Ej is said first exponent set and Ej is a subsequent exponent set with j > i, 

Ei = (^uo. •••• » ^i.D-l)' 
Ej ~ (^j.0» ^j.2» J ^j,D.l)' 

15 n is an integer representing the number of exponents in a said set of 

exponents, 

m = 0, 1, 2, .... , 72-1. 

Preferably the processing carried out by the weighted routing stage and first neural layer 
20 includes determining 
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the operator r[»] is defined as: 
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where /(Yj) is +1 if y^O else it is 0, 

Y represents outputs of the first neural layer, 

T are threshold values determined during a training phase, 
w are weighting values determined during the training phase, and 
5 is the number of exponent sets in the sequence comprising said data. 

In one form of the invention, the selection stage comprises selecting an output y, of the 
first neural layer such that = 1 and a is maximum for i<a<b. The plurality of 
exponent coding strategies may comprise strategies S|, S^, ... , S^, where c £ b, 
10 corresponding to respective differential coding limits 1, 2, ... ,c, wherein the exponent 
coding strategy assigned to said first exponent set is selected according to 

Y - max[min(a+l,a(£:i)),l] 
where a(E-J =yi^oK(Ijlkij*y-e,j|l//2)-h0.5). 

15 The exponent sets in the sequence between /+ 1 and a, inclusive, can then be assigned the 
exponent re-use coding strategy. 

The present invention also provides a method for processing data in an audio data encoder, 
the data comprising a sequence of exponent sets each comprising a plurality of exponents, 
20 comprising the steps of: 

determining a first variation of exponent values within a first exponent set; 

determining a second variation of exponent values between said first exponent set 
and each subsequent exponent set in said sequence; and 
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assigning an exponent coding strategy to the first exponent set based on the 
determined first and second variations. 

The invention also provides a digital audio encoder in which audio data is transformed into 
5 coefficients having mantissas and exponents arranged in a sequence of sets, having: 
a first variation processor coupled to receive the exponents of sets from said 
sequence and determine a first variation of exponent values between a first set and a 
plurality of subsequent sets in the sequence; 

a second variation processor coupled to receive the exponents of said first set and 
1 0 determine a second variation between consecutive exponent values within said first set; 
and 

a neural network processor coupled to receive said first and second variations and 
select and assign an exponent coding strategy to said first set from a plurality of coding 
strategies on the basis of said first and second variations. 

15 

Brief Description of the Drawings 

The invention is described in greater detail hereinafter, by way of example only, with 
reference to several embodiments, and having regard to the accompanying drawings, 
20 wherein: 

Figure 1 is a diagrammatic illustration of the data structure of an encoded audio 
data bit stream, having frames and blocks; 

Figure 2 is a functional block diagram of an audio encoder; 
Figure 3 is a diagramnutic illustration of mapping of exponent sets to coding 
25 strategies; and 

Figure 4 is a diagrammatic illustration of a neural network system for selection of 
exponent coding strategies. 



30 



Detailed Description of the Preferred Embodiments 
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Suppose a frame comprises b blocks and a block B-, is composed of an ordered exponent 
setEi (Ci o. Ci j, ei„.i) and mantissa set (rriio. nii j, rrii^, ^i.«.;>). If the 

elements in E, are very similar to those in Ej^,, E,^^, ... , E,^„ then only exponent set E^ 
needs to be coded into the bitstream. Since the fmal question is more a matter of 
5 perception (the coding must not cause any audible degradation) rather than an exact 
closed form mathematical equation, a stochastic approach is suitable. Neural Networks 
are known to solve such problems efficiendy with minimum computation requirements. 

The maximum difference value allowed between exponents should be based on the 
10 variation in the exponents and the number of blocks over which the exponent set is to be 
reused. Allowing a higher value of maximum allowed difference results in less coding 
error. If an exponent value is to be reused several times it is necessary to code it with 
minimal error or else the accumulated error over several blocks could be quite 
appreciable. 

15 

Consider a set of exponent coding strategies {So, Si, S^, . }. Let strategy Sq be defined 
as yeus€\ That is, if exponent setE;, = (Ci^^ e.- j, e,-^, e,-.^/>) uses strategy Sq for 
coding then essentially no exponent is transmitted. Instead the receiver is directed to use 
exponents of Ej., as those of exponent E^. Next, let Sj be defined as an exponent coding 
20 strategy where all e:q)onents are differentially encoded with the constraint that the 
difference between any two consecutive exponents is always -1. +1 or 0. That is, for 
exponent coding strategy Sp the maximum allowed difference, L, equals +/-1. 
Similarly, for strategy S2 , let die maximum allowed difference, L, be -1-/-2, and so on for 
S3, S4, . - - . 

25 

The inputs (Eq, Ei, E^, E^^,) are presented to the neural network system. The output 
(Oo, o,, o^, . . . , o^j) of the system is the exponent strategy corresponding to each exponent 
set. 
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The input to an audio coder comprises a stream of digitised samples of the time domain 
analog signal, which is illustrated diagrammatically in Figure 1. The bit stream comprises 
a sequence of frames, with each frame consisting of b blocks identified as (Bq, B,, B-,, 
B,^i) . Each block contains n consecutive samples of data. A frame is an independent 
5 entity, such that there is no inter-frame data sharing. This facilitates splicing of encoded 
data at the frame level, and rapid recovery from transmission error. By grouping blocks 
to form a frame coding efficiency is increased since redundant data is shared, as permitted 
by signal conditions, across the frame. The block size in a particular coding standard is 
usually determined based on spectral and temporal resolution requirements for the audio 
10 signal. 

Figure 2 illustrates a general block diagram of an audio encoder 50. Within any block B-, 
the n samples form an ordered set (s^o, s^ j, j,. j,.^^;. These time domain samples 

h2» /.«./>) are converted to the frequency domain using an analysis filter bank 
15 52. The frequency domain coefficients, thus generated, form another ordered set which 
can be identified as fq^ q j, q^, C;„J. Here is the lowest frequency (dc) 
component while is die highest frequency con^onent of the signal. 

Audio compression essentially entails finding how much of the information in the set (Cf o, 

20 C; 2 Ci„,j) is necessary to reproduce the original analog signal at the decoder with 

minimal audible distortion. 

The coefficient set (Cf o, c^, C;^^, q,,,,) is next converted into floating point format, 
where each coefficient c^j is represented by an exponent and mantissa /n,y. By 
25 definition, e^j ^ floordog^icj). Also, mantissa /n,.^ = (cj/e^. The exponents and 

mantissas form two ordered sets E; „ (e,^^ e. e, and , f/n,^ m,;, /n,^ 

'"/.n./J- Therefore, each coefficient c^ j is related to the exponent e.^ and mantissa /n^ by the 
relation c,-y = /7Z;y * 2"^ 
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The exponent sec is usually transmitted in its original form. However, the mantissa is 
truncated to a fixed or variable number of decimal places. Suppose mantissa /7i,-y is 
transmitted as b^j number of bits. The value b-.j is usually obtained from a bit allocation 
algorithm which for advanced psychoacoustic coders may be based on the masking 
5 property of the human auditory system. Low value of bi j results in higher compression 
ratio because less space is required to transmit the coefficients. However, this causes very 
high quantization error leading to audible distortion. A good distribudon of available bits 
to each mantissa forms the core of the most advanced encoders. 

1 0 Further compression is made possible in audio coders by application of certain coding 
techniques for exponents. Since most music is characterised by slowly changing spectrum 
across audio blocks, it is possible to reuse the exponents of a block B-, as exponents of the 
next few following blocks within die frame. That is, if the elements in exponent set Ej are 
very similar to the corresponding elements in sets Ei+i^ E,+2^ Ei^.^ then only exponent 

15 set Ei needs to be coded into the bitstream. A bit can be set to indicate to the decoder to 
reuse Ej, for the next a blocks in the frame. The similarity of set E^^ to another set Ej is, 
in general, a difficult problem to solve. 

In addition to reuse, the exponents for a channel are usually differentially coded across the 
20 frequency range. That is, instead of sending the actual values, only the difference 
between the exponents are sent. However, as mentioned, the e^onent coding schemes 
limit the allowable difference between adjacent exponents in the sequence to a fixed value, 
called L, which is predefmed by the standard. This does not take into consideration the 
actual signal characteristics. As a result, sometimes for fast varying spectrum the limit is 
25 too low which results in information loss due to truncation. At other times the limiting 
value may be too large for a slow varying spectrum, thereby resulting in bit wastage. 

Instead of allowing variation in value of L, some existing coders modify the frequency 
resolution in order to provide varying levels of compression for exponents. For example. 
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instead of sending delta values for every element in set (Cio, e,j, 6,2, Ci j, e^^ e^^J, 
only delta for every 2 (or 3 or 4 etc.) frequency coefficients are sent. That is, the 
transmitted set is (Ci o, e^f ' » ^Ln^i)- At the receiver the values for e^ j, e^ j, ^ are 
obtained by interpolation of the received values. However, this scheme is essentially a 
5 subset of the former scheme of varying the value of L. 

To pose the problem more formally, suppose a set of exponent coding strategies is defined 
as {So, Si, Sj, ... }. Let strategy Sq. be defined as "reuse". That is if exponent set 
uses strategy Sq for coding then essentially no exponent is transmined but instead the 

10 receiver is directed to use exponents of Ej.^ as those of exponent Ei. Next, let Sj be the 
exponent coding strategy where all exponents are differentially encoded with the constraint 
that the difference between any two consecutive exponents is always -1. +1 or 0. That is, 
for exponent coding strategy Sp die maximum allowed difference, L, equals 
Similarly, for strategy S^, let the maximum allowed difference, L, be +/-2, and so on for 

15 S3, S4, ... . 

Based on the above discussion, the problem of exponent strategy assignment can be 
formerly stated as a injective mapping of each exponent set E^ within a frame to a suitable 
strategy Sj such that maximum compression is achieved with minimal audible distortion. 
20 A mapping of exponent sets to coding strategies is illustrated diagrammatically in Figure 

3. Since each frame is an independent entity, exponent set Eq is not allowed to use 
strategy So or else it would attempt to use the exponent information in the previous frame. 

For example, suppose a frame consists of six blocks with exponent sets E^, E^, E^, E3, E^ 
25 and E3, and the assigned strategy is S5, So Sq, Sq, Sq and S^, respectively. This means 
exponents in set Eq are differentially encoded with the constraint that the maximum 
allowed difference is +/-5 (by previous definition of strategy S5). For blocks 1, 2, 3 and 

4, no exponent value is sent as they are instructed to reuse the exponents from the 
previously transmitted set. Block B/s exponents, that is E^, are transmitted as 
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differentially encoded values with the constraint that the maximum allowed difference is 
+ (definition of strategy Sj). 

The optimal mapping is virtually impossible to find due to the complex behaviour of the 
5 system, large numbers of competing parameters, and the fact that optimality depends on 
perceived quality of reproduced music rather than on a closed form equation with a 
minimal value. Moreover, there is a long-standing debate on trade-offs between quality of 
final music, compression ratio achieved and computational requirements of the system. 
Therefore, having formally outlined the problem space, a stochastic approach is used to 
10 provide the solution. A Neural Network based solution is described herein. 

Neural Network System 

A neural network's ability to perform computation is based on the effort to reproduce the 
15 flexibility and power of the human brain by artificial means. Neural computation is 
performed by a dense mesh of computing nodes and connections. They operate 
collectively and simultaneously on most or ail data inputs. The basic processing elements 
of the neural network are called nodes, or simply neurons. They can be considered as 
threshold units that fire when their total weighted input exceeds certain bias levels. 
20 Neurons are often organised in several layers, with neurons in consecutive layers often 
being connected together. Each connection strength is expressed by a numerical value 
called weighty the value of which is determined by the specific system which the network 
attempts to model. 

25 Figure 4 is a diagrammatic illustration of a neural network system for selection of 

exponent coding strategies. The inputs to the neural network system are the exponent sets 

(E„ E,, Ej, E^,), where Ej = (e^o, e-u. f/.?. .... e^^J. The output (Oq, o,, o^,) 

is die exponent strategy corresponding to each exponent set. Staning from Eo die strategy 
for each exponent set is determined. Suppose the exponent strategy for E, is to be 
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determined. The exponent coding strategy for depends on: 



1 . The number of times Ej will be reused across the blocks in the frame. If an 
exponent value is to be reused several times it is necessary to code it with minimal 
error or else the accumulated error over several blocks could be quite appreciable. 



2. The variation between consecutive elements in the set E^, For a spectrum 
with large variations across the frequency range the limit L must be high to 
minimise information loss due to truncation. For a spectrum with low variation a 
low value of L is adequate and this results in better compression ratio. 



A two stage network is used to take both these aspects into consideration when deciding on 
the strategy for an exponent set. The first criteria is of the number of blocks over which 
an exponent set is reused. This is dealt with primarily by the first layer. If the elements 
15 in Ei are very similar to those in Ei+i , E^^j, Ej+, then only exponent set Ej needs to be 
coded into the bitstream. A flag can be set to indicate to the decoder to reuse Ej for the 
next a blocks in the frame. 

Feature Extraction 

20 

To find the number of blocks over which the same exponent set can be safely used without 
audible distortion in signal, a measuring function for similarity is required. The 
correlation coefficient: 



25 is mathematically robust method for computing similarity. But it is too computationally 



E(e. -e)*(e -e) 



Eq. 1 



(e. -ef*(e -e)^ 
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intensive for a practical system. Let Adiff(^^, Ej) be a function that determines the 
similarity between the elements of the set and E^ and returns a value which shows the 
degree of difference. The similarity function Adiff {m^^n absolute difference) is defined 
as: 



The values Adiff(E,, Adiff^,, E^^J Adijf(E.,, E^,) form the input z,, z^,... z^,, 

respectively, to the first stage of neurons. Input Zq is always set to 0, as discussed 
hereinbelow. Since this step extracts only that characteristic of the input which is 
10 instrumental in the decision process of the network, it is termed as feature-extraction. 

Reuse Strategy 

The first stage of the network determines the maximum number of exponent sets that are 
15 considerably similar to a given set Ei- The topology of die network and connection 
weights are such that the neuron fires (corresponding output is 1) if Zj is within 
acceptable limit. Similarly, fires if both and z^ are within acceptable linnits. Since 
= ^rf/^TE,. Ei^i), it implies that if fires then E, and E^^i are very similar. Similarly, if 
72 fires it implies Ej^j and Ej^^ are both very similar to Ej. As Zq and the threshold value 
20 for Jq are both zero, always fires. 

Each neuron in the first layer therefore computes the weighted sum of its input. If the 
weighted sum is less dian or equal to the threshold value, the neuron fires thus making the 
ouqpui 1. If the threshold value is larger than the weighted sum of inputs it means that not 
25 all inputs to this neuron are similar to Ei and thus the neuron does not fire. Using matrix 
notation the output of the first stage of the network is: 



Adiff(E,.Ej) = (S„|e,„.e^.,|)/n 



Eq.2 
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Eq. 3 



the operator r[»] is defined as: 







AYo) 


Y, 




Ay,) 


Y2 




Xy,) 






Ay*.,). 



Eq. 4 



where /(Yi) is -K 1 if y^O else it is 0. The weight (WiJ and the threshold (TJ values are 
determined from the training phase of the neural network. 



5 Perceptron learning rule is used for training the network. In this form of supervised 
training, the learning signal is a function of the difference between the desired and the 
actual neuron's response. Before the training begins, the weights are usually initialised to 
small random values (small values are necessary to avoid biased initial state) . An input 
vector is presented to the system and the output is computed. The output from the 

10 network is compared with the desired output. The desired output was determined by 
extensive simulation and listening experiments. If the actual and the desired values are 
different the weights need to be adjusted to condition the network to imitate the desired 
behaviour. The weights and the threshold are adjusted according to Eq. 5 and Eq. 6, 
respectively. Here dj is the desired output. The learning rate, « = 0. 1 1 , has been 
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experimentally found to be most suited for this application. 

AT. = a[d. 'X£,(^jj * ^)'Tp]*OA Eq.6 

In this system, the threshold requires a multiplication by 0. 1 for maintaining same rate of 
adjustments for both thresholds and weights. A good set of training vectors and well 
adjusted desired output are crucial to the correct setting of the weights. 

5 

An example of weights and thresholds of a trained system for a transform coder with six 
audio blocks is provided below: 



W = 



0 


0 


0 


0 


0 


0 




0 


0 


.05 


0 


0 


0 


0 




.02 


0 


.91 


.23 


0 


0 


0 




.33 














r = 


0 


1.2 


.96 


.29 


0 


0 




.71 


0 


.96 


.92 


.59 


.20 


0 




.75 


0 


1.2 


1.0 


.94 


.88 


.28 




1.2 



Analysing the weight matrix an interesting behaviour becomes apparent. Along any 
row the weights decrease from left to right. The reason for this is as follows. Consider 
10 output 

= /rv^z; * Adiff(E:^ E,^,) -h u/j, 2 * Adiff(E,, E,^^) - Eq.7 

15 

If we consider the spectrum of a signal across time, the spectrum shape at time ti^, will be 
intermediate between spectrum shapes at time t^ and t^^^, assuming At = t,^^ - = t^^^ • 
ti^j is small enough. Therefore, if reflects the spectral shape at time t^ and similarly 



wo 99/41844 



PCT/SG98/00009 



- 16- 

and the shape at t^^i and t^^^, respectively, then since the spectrum is changing 
over time, in general Adiff(Ei, Ej^f) ^ AdifflE,, E,^^. In Eq. 7, to level this discrepancy 
the multiplicative weight for Adiff(Ei, E;^^) is larger than that for AdiffiE,, Ef^^. 

5 In effect, the similarity function for estimating similarity of two exponent sets, Adiff(Ei, 
Ej), now has a multiplicauve factor = w^j. The new similarity function 

d(iJ)*Adiff(Ei, E^), judges similarity of two sets based on not only the mean absolute 
difference between the elements, but also the time difference expressed as difference in 
block number, thai exists between the two exponents sets. 

10 

Two exponents sets closer in time are expected to match much more than two sets with 
much greater time difference, in order for the similarity function to generate the same 
value. The fact that the network was able to automatically detect this peculiar 
characteristic shows its strength and our correcmess in modelling of the system. It is to be 
15 noted that the system is extremely robust. In an existing system new weight vectors* 
obtained from perhaps superior training data and training-rule, can always be updated to 
provide better results. 

Winner Takes all Strategy 

20 

To the careful reader it is evident that at any one time it is possible that more than one 
neuron fires. The objective is to determine the maximimi number of exponent sets which 
can be identified as similar to given set E-,. In the winner-takes-all strategy, starting from 
output yi^i and proceeding upward, each neuron is analysed to find the first one that has 
25 fired. Suppose y, is the first one from the bottom to have fired. Neuron y. is declared 
winner and it's output is left as 1. All other neurons in this layer are now switched off. 
As a result the second layer's output now entirely dependent on neuron y^. 

From the network topology and the previous discussions, it is evident that y,, always fires. 
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This ensures that there is always at least one winner. Now, neuron y, fires if exponent 
sets Ej and £^^.1 are fond by the first layer to be very similar. Similarly, y, fires if 
exponent set E-^^^ and Ei^., are both very similar to E^. 

5 In general, y, fires when £,+1, Ej^^, Ei+3 and Ej^. are all very similar to E.,. 

Therefore, exponent set Ej is only coded and transmitted into the bitstream. Exponent sets 
Ei+i, Ei+2, Ei+3 Ei+, all are forced to reuse the values in E, (that is, all are assigned 
strategy Sq). 

10 The exact strategy for E, still needs to be determined from the available options {Sj, S^, 
S3 ...}. A simple technique would be to always use a fixed strategy for Ej (e.g. S^) to 
provide minimal complexity. 

A slight improvement on it would be to choose a strategy depending on the number of 
15 blocks over which the exponent set Ej would be reused. Note that if is the winner then 
exactly a+ 1 blocks are using the same exponent sets. If an exponent value is to be reused 
several times it is meaningful to code it with minimal error or else the accumulated error 
over several blocks could be quite appreciable. Therefore one could choose strategy S.^i, 
where y. is the winner neuron. Thus by this process, if the winner neuron is y„ the 
20 strategy for E^, E^^^, E-,^^, £1^3..., E,^. is S...^, S^ So, So, So, respectively. 

The problem with this improved version is that it does not take the variations of the 
elements within the exponent set into consideration. A much improved coding strategy for 
Ei would take into consideration not only the number of reuse blocks but also the 
25 variation within Ei. 

Mean absolute deviation, defined as Adev(Ei) = (Xj//ei^r^ J)/n) where '^i-ZCi/n, could be 
used to provide a measure of variation. However, because finally differential coding is to 
be done, it is much more meaningful to base the strategy on the mean average difference 
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between consecutive exponents, rather than the deviation of exponents from the mean. 
Consequently, we define mean absolute deviation as: 



Finally, the exponent coding strategy recommended for is S^, where y is defined by 
Eq. 9 below: 



By diis definition, y is always greater or equal 1. This ensures that reuse is never 
suggested for Ej. If variation value a(Ei) is greater than the number of reuse blocks, the 
value a+ 1 dictates which strategy is finally chosen. This provides a safety measure by 
preventing explosion in bit usage for large varying signals with limited reuse. When 
15 variation is low, as guided by <7f£J, bit usage is optinciised by using only as many bits for 
the differential limit as required and not being overwhelmed by the large nimiber of reuse 
blocks. 

At the end of this process the strategy for E^, E^^,, E^^^, E,^,, E^^., is S,, So, So, So, 
20 So, respectively. Now strategy for E,^.^,. E^^.^j, Ej^.^j, E^^j, needs to be 
determined. Treating Ei^..^^ as E, the whole process is repeated. 

The deviation values EjHeg^i-eijI/n) for some benchmark music streams are given in the 
tables below. From the statistics it is evident that there is enough variation in frequency 
25 domain for commonly occurring music streams to warrant a flexible limit for L in a 

differential coding scheme. Moreover, from the range of values, it can be concluded that 
absolute deviation can be effectively used to determine the maximum differential limit, 
and hence the exponent strategy. 



a(E,) = floor((£^le,j^reJ/n)-¥O.S) 



Eq. 8 



Y = n:iax[min(fl+l,o(£"i)),l] 



Eq. 9 
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Adev 


4.2 


3.13 


3.03 


2.6 


2.5 


2.7 
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10 


Reuse 
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1 



Harp 



Note the third and fourth column in the table for castanets. The exponents are reused 
across three blocks in both cases but the variations are radically different. In the first case 
1 5 a differential limit of 4 is necessary while in the second a limit of 3 should be adequate. 
This shows that improvement in quality and coding efficiency can be achieved using this 
technique proposed above. 

In summary of the above, referring again to the audio encoder shown in Figure 2, the 
20 frequency domain coefficients which are obtained from the analysis filter bank 52 are split 
into their constituent mantissas and exponents, and the exponents extracted (54) and passed 
to and exponent strategy processor 56. The exponent strategy processor 56 embodies the 
neural network system described hereinabove and illustrated diagrammatically in Figure 4. 
The exponent strategy processor 56 receives the extracted exponents and for each 
25 operation of the neural network system determines an appropriate coding strategy for at 
least one of the sets of exponents received. Each subsequent operation of the exponent 
strategy processor 56 then begins with the next exponent set not yet allocated a strategy. 
The exponents and assigned strategies are passed then to an exponent coder 58 which 
differentially codes the exponent sets according to the assigned strategies. The coded 



• # 
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exponents are passed to a quantizer 60 where they are quantized together with the 
corresponding mantissas for packing into the output bitstream. Where an exponent set has 
been allocated a re-use strategy, quantization is not required, and accordingly a "re-use 
flag" is passed direcdy from the exponent coder 58 to the bitstream packer 62, 

The foregoing detailed description of the invention has been presented by way of example 
only, and is not intended to be considered limiting to the invention as defined in the claims 
appended hereto. 
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Claims: 

1 . A method for processing data in an audio data encoder, the data comprising a 
sequence of exponent sets each comprising a plurality of exponents, comprising the steps 

5 of: 

determining a first variation of exponent values within a first exponent set; 
determining a second variation of exponent values between said first exponent set 
and each subsequent exponent set in said sequence; and 

assigning an exponent coding strategy to the first exponent set based on the 
10 determined first and second variations. 

2. A method as claimed in claim 1 , wherein the exponent coding strategy is assigned 
from a plurality of exponent coding strategies having different differential coding limits. 

15 3. A method as claimed in claim 2, including a step of coding said first exponent set 
according to the assigned exponent coding strategy. 

4. A method as claimed in claim 3, including a step of assigning an exponent coding 
strategies to at least one subsequent exponent set based on the corresponding determined 

20 second variation. 

5. A method as claimed in claim 4, wherein the plurality of exponent coding 
strategies includes an exponent set re-use strategy which is assigned to the at least one 
subsequent exponent set. 

25 

6. A method as claimed in claim 5, including a step of coding said first exponent set 
and said at least one subsequent exponent set according to the corresponding assigned 
coding strategies. 
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7. A method as claimed in claim 2 or 6, wherein the steps of determining the first and 
second variations are performed utilising neural network processing. 

8. A method as claimed in claim 7, wherein the neural network processing includes 
5 first and second neural layers. 

9. A method as claimed in claim 7, wherein the neural network processing comprises 
a feamre extraction stage in which said sequence of exponent sets is utilised to determine 
said second variations, a weighted routing stage in which said second variations are 

10 weighted according to predetermined weighting values and routed to inputs of a first 
neural layer, a selection stage in which an output of the first neural layer is selected, and 
an output processing stage in which a coding strategy is assigned to said first exponent set 
based on said first variation and the ouqjut of said selection stage. 

15 10. A method as claimed in claim 9, wherein a coding strategy is assigned to at least 
one subsequent exponent set on the basis of the ouqjut of said selection stage. 

11. A method as claimed in claim 10, wherein the coding strategy assigned to the at 
least one subsequent exponent set is an exponent re-use strategy. 

20 

12. A method as claimed in claim 9, wherein the feamre extraction stage comprises 
determining 

Adiff(E..E3) = (S„.le,„-ej,„|)/n 
where Adiff is said second variation, 

25 Ei is said first exponent set and Ej is a subsequent exponent set with j > i, 

Ei ~ (^i,0»^i.l» ^i.2» » ^uo-l)' 

" ^^j.O' ^J.l' t ^j.tv-l)' 

n is an integer representing the number of exponents in a said set of 

exponents. 
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m = 0, 1, 2, .... , n-1. 

13. A method as claimed in claim 12, wherein the processing carried out by the 
weighted routing stage and first neural layer includes determining 
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5 the operator r[*] is defined as: 



Yo " 




XYo) 


Y, 




Xy.) 


Yj 




XY2) 






/(Y^-,) 



where f{y) is +1 if y^O else it is 0, 

Y represents outputs of the first neural layer, 
T are threshold values determined during a training phase, 
w are weighting values determined during the training phase, and 
10 b is the number of exponent sets in the sequence comprising said data. 



14. A method as claimed in claim 13, wherein the selection stage comprises selecting 
an output y. of the first neural layer such that y. = 1 and a is maximum for i < a< b. 
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15. A method as claimed in claim 14, wherein the plurality of exponent coding 
strategies comprises strategies S,. Sj, ... . Se. where c s b, corresponding to differential 
coding limits 1,2, ... ,c. 

5 16. A method as claimed in claim 15, wherein the exponent coding strategy 
assigned to said first exponent set Ej is selected according to 

where a(EJ = floor((£^je,,.^,.eij/n)+0.5). 

10 17. A method for coding audio data comprising a sequence of exponent sets each 
comprising a plurality of exponents, comprising the steps of: 

determining a first variation of exponent values between a first exponent set in the 
sequence and each subsequent exponent set in said sequence; 

selecting an exponent coding strategy for said first ejqjonent set from a plurality of 
15 e3q)onent coding strategies on the basis of said first variation; and 

coding said first exponent set according to die selected exponent coding strategy. 

18. A method as claimed in claim 17, wherein each of the plurality of exponent coding 
strategies corresponds to a different differential coding limit. 

20 

19. A method as claimed in claim 17, including selecting one of said subsequent 
exponent sets on the basis of said first variation and assigning an ejqionent re-use coding 
strategy to the selected exponent set and any exponent sets in said sequence between the 
first exponent set and the selected exponent set. 

25 

20. A method as claimed in claim 17, 18 or 19. including a step of determining a 
second variation between consecutive exponents in said first exponent set. wherein the 
exponent coding strategy for said first exponent set is selected on the basis of said first and 
second variations. 
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21. A method as claimed in claim 17, wherein the step of selecting the exponent 
coding strategy for said first exponent set is performed utilising neural network 
processing. 

5 22. A method as claimed in claim 20, wherein the step of selecting the exponent 
coding strategy for said first exponent set is performed utilising neural network 
processing. 

23. A method as claimed in claim 22, wherein the neural network processing 

10 comprises a feature extraction stage in which said sequence of exponent sets is processed 
to determine said first variation values, a weighted routing stage in which said first 
variation values are weighted according to predetermined weighting values and routed to 
inputs of a first neural layer, a selection stage in which an output of the first neural layer 
is selected, and an output processing stage in which a coding strategy is assigned to said 

1 5 first exponent set based on the output of said selection stage and said second variation. 

24. A method as claimed in claim 23, wherein a coding strategy is assigned to at least 
one subsequent exponent set on the basis of the output of said selection stage. 

20 25. A method as claimed in claim 24, wherein the coding strategy assigned to the at 
least one subsequent exponent set is an exponent re-use strategy. 

26. A method as claimed in claim 23, wherein the feature extraction stage comprises 
determining the first variation values according to 



25 



Adiff(E,, E3) = (2;„|e,„-ej.„|)/n 



where 



Adiff is said first variation, 

Ej is said first exponent set and Ej is a subsequent exponent set with i > j, 

~ (^i.0» ^i.M ^i.2» ' ^i.n-i)» 

~ ^^j.0» ^j.P Cj.2» y 
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n is an integer representing the number of exponents in a said set of 

exponents, 

m = 0, 1, 2, .... , n-1. 

5 27. A method as claimed in claim 26, wherein the processing carried out by the 
weighted routing stage and first neural layer includes determining 
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the operator is defined as: 
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where /(Vi) is +1 if y^O else it is 0, 

Y represents outputs of the first neural layer, 
10 T are threshold values determined during a training phase, 

w are weighting values determined during the training phase, and 

b is the number of exponent sets in the sequence comprising said data. 

28. A method as claimed in claim 27, wherein the selection stage comprises selecting 

'4^ 
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an output y, of the first neural layer such that y. = 1 and a is maximum for i < a< b. 

29. A method as claimed in claim 28, wherein the plurality of exponent coding 
strategies comprises strategies Sp Sn, ... . S^, where c ^ b, corresponding to respective 

5 differential coding limits 1, 2, ... ,c. 

30. A method as claimed in claim 29, wherein the exponent coding strategy 
assigned to said first exponent set E, is selected according to 

Y = max[min(a+l,a(£i))J] 
10 where a(E^ ^ floor((£jie,j^,-eij/n)^0,5). 

31. A digital audio encoder in which audio data is transformed into coefficients having 
mantissas and exponents arranged in a sequence of sets, having: 

a first variation processor coupled to receive the exponents of sets from said 
15 sequence and determine a first variation of e?q3onent values between a first set and a 
plurality of subsequent sets in the sequence; 

a second variation processor coupled to receive the exponents of said first set and 
determine a second variation between consecutive exponent values within said first set; 
and 

20 a neural network processor coupled to receive said first and second variations and 

select and assign an exponent coding strategy to said first set from a plurality of coding 
strategies on the basis of said first and second variations. 

32. An audio encoder as claimed in claim 31, wherein each of the plurality of coding 
25 strategies correspond to a different differential coding limit. 

33. An audio encoder as claimed in claim 31 or 32, wherein the neural network 
processor also selects and assigns an exponent coding strategy to at least one of the 
subsequent sets. 
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34. An audio encoder as claimed in claim 33, wherein the exponent coding strategy 
assigned to the at least one subsequent sets is an exponent re-use strategy. 

35. An audio encoder as claimed in claim 31, wherein the neural network processor 
5 includes a weighted routing stage in which said first variation values are weighted 

according to predetermined weighting values and routed to inputs of a first neural layer, a 
selection stage in which an output of the first neural layer is selected, and an ou^ut 
processing stage in which a coding strategy is assigned to said first exponent set based on 
the output of said selection stage and said second variation. 

10 

36. An audio encoder as claimed in claim 35, wherein the first variation processor is 
arranged to determine said first variation according to 

Adiff(E.,Ep = (S„|e,„.ej,„l)/n 
where Adiff is said first variation, 

15 Ei is said first exponent set and Ej is a subsequent exponent set with i > j, 

~ (^UO' Cu2» » ^i.a-i)» 

Ej = (Cjo, Cj ,, e^ j, .... , Cja.i)j 

n is an integer representing the number of exponents in a said set of 

exponents, 
20 m = 0, 1, 2, .... , n-1. 

37. An audio coder as claimed in claim 31 or 36, wherein the weighted routing stage 
of the neural network processor is arranged to determine 
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the operator r[»] is defined as: 
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where f{y ) is +1 if y^O else it is 0, 

Y represents outputs of the first neural layer, 
T are threshold values determined during a training phase, 
5 w are weighting values determined during the training phase, and 

b is the number of sets in the sequence. 



38. An audio encoder as claimed in claim 37, wherein the selection stage comprises 
selecting an ou^ut of the first neural layer such thai y. = 1 and a is maximum for 

10 i<a<b. 

39. A method as claimed in claim 38, wherein the plurality of exponent coding 
strategies comprises strategies Sj, S^, ... , S^, where c ^ b, corresponding to respective 
differential coding limits 1,2, ... ,c. 

15 

40. A method as claimed in claim 39, wherein the exponent coding strategy 
assigned for encoding exponents in said first set Ej is selected according to 

r = max[min(a+l,o(£i)),l] 
where a(E,) ^ floor((£jieij^^-e^j/n)+0.5). 



20 
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